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DETAILED ACTION 

1 . This action is responsive to communications: The RCE filed 03/23/06. 

2. Claims 1, 6-7, 10, and 16-17 have been Amended. 

3. Claims 19 and 20 have been added as necessitated by Amendment. 

4. Claims 1-3, 6-9, 10-12, and 15-18 remain rejected under 35 U.S.C. 103(a) as being 
unpatentable over Russell-Falla et al (US-6,675,162 01/06/04) in view of Chakrabarti et al (US- 
6,389,436 05/14/02). 

5. Claim 4-5 and 13-14 remain rejected under 35 U.S.C. 103(a) as being unpatentable over 
Russell-Falla et al (US: 6,675,162 01/06/04) in view of Chakrabarti et al (US-6,389,436 
05/14/02) in further view of Haug et al (US: 6,556,964 04/29/03). 

6. Claims 1-20 are pending in the case. Claims 1 and 10 are independent claims. 

Claim Rejections - 35 USC § 103 

7. The following is a quotation of 35 U.S.C 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

8. Claims 1-3, 6-9, 10-12, and 15-18 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Russell-Falla et al (US-6,675,162 01/06/04) in view of Chakrabarti et al (US- 
6,389,436 05/14/02). 

-In regard to independent claims 1 and 10, Russell-Falla et al teach a method and 
apparatus for determining content type of a web page comprising: 
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providing a predefined set of potential content types (categories of content)(column 2, 
lines 35-43); 

for each potential content type (categories of content)(column 2, lines 35-43)(e.g. 
"pornographic", "racist", etc)(column 3, lines 39-43), preparing a distinguishing series of binary 
tests (column 2, lines 56-63; column 3, lines 39-57; column 4, lines 61-66)(i.e. testing each 
keyword or regular expression of the content against a database of keywords and regular 
expressions common to the content type for matching and weighting purposes), wherein at least 
one test determines whether a predefined piece of data or keyword appears in URLs (column 2, 
lines 5-9) of the subject Web page (column 2, lines 56-63; column 3, lines 23-57, column 4, lines 
61-66: i.e. testing all of a web pages textual content which as appreciated by one skilled in the art 
would include the text in the URLs of the selected web page); 

for each potential content type (categories of content) (column 2, lines 35-43)(e.g. 
"pornographic", "racist", etc)(column 3, lines 39-43), running the distinguishing series of tests 
(column 2, lines 56-63; column 3, lines 39-57; column 4, lines 61-66)(i.e. testing each keyword 
or regular expression against a database of keywords and regular expressions common to the 
content type for matching and weighting purposes) enabling quantitative evaluation of some 
contents of the selected web page being of the potential content type (column 2, lines 55-64); 

mathematically combining the test results (column 3, lines 54-57); and 

based on the results, assigning a probability (equivalent to the final rating of the page 
relative to the content category), for each potential content type, that shows the likelihood that 
some contents of that type exist on the selected web page (column 3, lines 2-6). 
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Russell-Falla et al do not teach wherein the distinguishing series of tests includes at least 
one or more non-binary tests. Chakrabarti et al teach a distinguishing series of tests to determine 
the potential content type of a web page (column 4, lines 5-15), wherein the distinguishing series 
of tests include non-binary tests (i.e. the test results in more than two possible outcomes)(column 
6, lines 27-67; column 7, lines 1-4). It would have been obvious to one of ordinary skill in the 
art at the time of the invention for Russell-Falla et al to have run the additional non-binary tests 
as taught in Chakrabarti et al for determining the content type of web pages, because Chakrabarti 
et al teach that by utilizing in/out links of the web page with the hypertext classifier (Fig. 1 : 110) 
the accuracy of classification goes up over those tests utilizing only local text/terms of the 
document to be classified (column 7, lines 34-59). 

-In regard to dependent claims 2, 11, and 15, Russell-Falla et al further teach wherein 
the set of potential content types could include web page articles/news with information about 
people (e.g. pornography, racism, terrorism) and other content (column 2, lines 10-23; column 3, 
lines 41-43). 

-In regard to dependent claims 3 and 12, Russell-Falla et al further teach producing a 
respective confidence level (equivalent to the rating of the page relative to the content category) 
for each potential content type when at least some of the web page content was of that type 
(columns 2 & 3, lines 54-67 & 1-6). 



Application/Control Number: 09/768,869 Page 5 

Art Unit: 2178 

-In regard to dependent claims 6 and 16, Russell-Falla et al further teach wherein the 
step of running the tests includes determining whether a predefined piece of data or keyword 
("weighting list") appears in the web page (column 2, lines 56-63). 

-In regard to dependent claims 7 and 17, Russell-Falla et al further teach wherein the 
step of running the tests includes determining whether a predefined piece of data or keyword 
("weighting list") appears in the web page (column 2, lines 56-63). 

-In regard to dependent claims 8, 9, and 18, Russell-Falla et al do not teach storing 
indications of the assigned probabilities (web page ratings) of each potential content type cross 
referenced with each respective web page in a database. It would have been obvious to one of 
ordinary skill in the art at the time of the invention to have stored previously viewed web pages 
along with there respective ratings for content types local to the user, because it was well known 
in the art at the time of the invention that storing frequently view web pages with their ratings 
would significantly reduce the determination/processing time of the Russell-Falla et al system by 
eliminating undue identifying, analyzing, and calculating on identical web page requests. Thus a 
repeated request could render an appropriate web page more efficiently which would benefit 
Russell-Falla et al which teach that analyzing web pages could be difficult and time consuming 
(column 1, 38-43). 

-In regard to dependent claims 19 and 20, Russell-Falla et al teaches wherein one of 
tests was determining the number of phrases that contain the keyword (column 2, lines 56-63; 
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column 3, lines 39-57; column 4, lines 61-67; column 5, lines 1-22: i.e. the number of keywords 
from the selected web page that have a corresponding entry in the pre-existing database are 
placed in a weighted list and then summed together to determine a rating to be compared to a 
given threshold). 

9. Claim 4-5 and 13-14 are rejected under 35 U.S. C. 103(a) as being unpatentable over 
Russell-Falla et al (US: 6,675,162 01/06/04) in view of Chakrabarti et al (US-6,389,436 
05/14/02) in further view of Haug et al (US: 6,556,964 04/29/03). 

-In regard to dependent claims 4 and 13, Russell-Falla et al further teach wherein the 
test results utilize a neural network (column 4, lines 1-5). Russell-Falla et al do not teach 
wherein the combining of the test results includes using a Bayesian network. Haug et al teach 
wherein the application of a Bayesian network for statistical pattern recognition provides 
improved system performance with additional training of the network (column 3, lines 8-16). It 
would have been obvious to one of ordinary skill in the art at the time of the invention, for the 
invention of Russell-Falla et al to have employed a Bayesian network as shown in Haug et al, to 
achieve the above mentioned improved system performance, because Russell-Falla et al do 
provide the needed training of the network (column 3, lines 58-67) which would be needed to 
increase the statistical recognition needed to support the Bayesian network. 

-In regard to dependent claims 5 and 14, Russell-Falla et al further teach the step of 
training the neural network using a training set of web pages with respective known content 
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types and collecting the statistics on the test results of the training web pages (column 3, lines 
58-67). 

Response to Arguments 

10. Applicants arguments filed 03/01/06 have been fully considered but they are not 
persuasive. 

With regard to the Applicant's arguments regarding the independent claims, the 
Examiner respectfully disagrees with the Applicant that said amendments would be allowable 
over the prior art of record While the Examiner agrees with the Applicant that the Russell-Falla 
reference fails to teach or suggest testing involving syntax, grammar, or style, the Examiner does 
not agree that Russell-Falla does not teach analyzing the URLs of the subject web page. Russell- 
Falla clearly teaches wherein the analyzed web pages were standard HTML documents that 
included URLs for linking to additional documents. Russell-Falla also teaches wherein all of a 
documents textual content was broken down and compared to a pre-existing database to 
determine if any of the content matched entries in the database. It would have been appreciated 
by one of ordinary skill in the art that HTML URLs contained textual data that indicated in some 
way to the web page viewer the content of the linked web page. In regard to the newly added 
claims the Examiner notes that Russell-Falla clearly teaches placing each instance of a 
recognized keyword in the web page into the weighted list and then summing each instance to 
eventually be compared to a predefined threshold. 
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Conclusion 

1 1 . The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. 

US-6,621,930 09-2003 Smadja, Frank 

US-6,647,396 1 1-2003 Parnell et al 

12. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Adam L. Basehoar whose telephone number is (571)-272-4121. 
The examiner can normally be reached on M-F: 7:00am - 4:00pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Steve Hong can be reached on (571) 272-4124. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 
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